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ABSTRACT 

Exposures  to  many  unconventional  pathogenic  agents 
result  in  flu-like  illness  that  are  initially  indistinguishable 
from  common  respiratory  illnesses  and  early  diagnosis  to 
distinguish  among  the  severe  vs  common  viral  infections 
depends  on  pathogen  proliferation  to  dangerous,  near- 
untreatable  levels.  Assessing  exposure  to  a  pathogen,  in 
advance  of  onset  of  illness  or  at  various  stages  post¬ 
exposure,  is  invaluable  among  the  diagnostic  options. 
Lymphocytes  serve  as  “whistle  blower”  indicators  as 
they  encounter  pathogenic  agents  even  early  during  the 
course  of  infection,  registering  the  encounters  in  their 
mRNA  and  developing  patterns  of  expression  that 
correspond  to  each  specific  pathogen.  Time  series  of 
gene  expression  patterns  relate  to  the  stage  or  severity  of 
the  infection  and  are  unique  for  each  pathogen. 

We  are  using  the  host  blood  for  determination  of  whole 
genome  regulation  in  response  to  various  viral  agents  to 
extract  features  and  signatures  that  can  be  used  for  point- 
of-care  diagnosis  of  various  viral  infections  (common 
respiratory,  arena,  flavi-,  alpha-  and  other  viruses).  These 
data  also  have  the  potential  to  provide  stage-appropriate 
therapeutic  targets. 

These  studies  utilized  exposure  time  sequences  of  host 
gene  expression.  Series  #1  contained  common 
respiratory  viruses  (influenza  A,  parainfluenza, 
rhino  virus,  respiratory  synctical  virus). 

Series  #2  focused  on  5  arena  viruses  (the  highly  virulent 
hemorrhagic  virus,  Lassa,  and  2  additional  pairs  of 
virulent/avirulent  arena  viruses).  Series  #3.  includes 
flaviviruses  (West  Nile  virus  and  4  serotypes  of  Dengue), 
and  Series  #4,  2  alpha  viruses,  among  which  was 
Venezuelan  equine  encephalitis  (VEE).  The  “training 
sets”  were  constructed  from  in  vitro  exposures  to 
purified  peripheral  blood  leukocytes  from  -6  human 
leukapheresis  donors  for  each  virus  described  above. 
Numerous  modeling  /  mathematical  techniques  were 
applied  to  these  datasets  in  order  to  identify  signature 
patterns  indicative  of  each.  The  “shrink/grow”  modeling 
approaches  were  used  as  well  as  other  algorithms  that 
have  shown  success  for  signature  extraction.  For  the 
“grow”  algorithm,  genes  are  individually  selected  that 
have  the  best  discriminating  power  and  the  first  of  those 
frequently  show  properties  unique  for  specific  viral 
infections. 


Our  first  question  related  to  the  fact  that,  by  necessity 
(due  to  the  lack  of  human  samples  from  exposures  to 
biothreat  pathogens),  we  were  training  on  in  vitro 
exposures  to  these  many  pathogenic  agents.  Our  thesis 
was  that  such  an  exposure  would  provide  a  biochemical 
signature,  related  to  the  mechanistic  course  of  action  of 
each  pathogenic  agent  and  it  should  be  reflective  of  at 
least  early  events  from  in  vivo  exposures  (to  non-human 
primates-NHP). 

Our  first  “test”  data  set  was  from  host  gene  expression 
responses  in  peripheral  blood  (leukocytes)  from  NHP 
exposed  to  VEE.  This  is  a  mosquito-borne  viral  disease 
characterized  by  fever  and  one  or  more  of  the  following: 
severe  headache,  back  pain,  myalgias,  prostration,  chills, 
nausea,  vomiting,  weakness  and  other  flu-like  symptoms. 
The  aerosol  form  of  VEE  is  highly  infectious,  making 
VEE  a  potential  biowarfare  agent.  This  could  be 
especially  worrisome  if  strains  are  altered  genetically  to 
increase  pathogenicity.  If  this  virus  was  deployed 
efficiently,  it  could  incapacitate  thousands  of  people  for  a 
week  or  more  and  cause  untold  psychological  stress  as 
well  as  being  physically  debilitating. 

Diagnosis  of  VEE  relies  on  virus  isolation  from  acute 
phase  serum  or  from  spinal  fluid,  or  on  detection  of 
VEEV-specific  IgM  in  the  cerebrospinal  fluid  in  cases  of 
encephalitis.  We  carried  out  2  different  approaches  using 
blood  from  12-25  control  (unexposed)  NHP  along  with  8 
samples  from  VEE-exposed  NHP.  Although  no  signs  of 
illness  were  apparent  by  day  3,  gene  expression  patterns 
identified  these  NHP  as  having  mild  exposure  to  the 
virus.  To  increase  the  level  of  confidence  in  the 
methodology,  the  NHP  samples  from  12  control  and  5 
VEE  challenged  NHP  were  blinded  as  “test”  datasets.  All 
of  the  17  samples  were  correctly  categorized  as  to  the 
nature  of  the  exposure.  Using  both  types  of  training/test 
methods,  one  control  NHP  was  categorized  correctly  as  a 
“control”  but  had  at  least  some  indicators  of  common 
viral  exposure. 

Additional  studies  are  underway  to  apply  the  most 
instructive  of  these  algorithms  to  arena  viruses  (NHP 
exposed  to  Lassa  virus),  people  exposed  to  dengue  virus 
or  West  Nile  virus  or  influenza  A. 

1.  INTRODUCTION 

Preparations  for  the  Army  of  the  future  include  reliance 
on  high  technology  instrumentation  for  diagnostic  and 
therapeutic  approaches.  This  highly  sophisticated 
hardware  requires  extensive  computational  capabilities 
and  will  have  the  potential  to  provide  a  wealth  of 
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important  information  to  aid  the  warfighter.  Our 
laboratory  is  focused  on  utilizing  gene  microarray 
technology  as  a  tool  for  establishing  rapid  diagnostics 
and  tailored  therapeutic  approaches. 

We  are  using  human  peripheral  mononuclear  cells 
(PBMC)  to  study  whole  genome  regulation  in  response 
to  various  viral  agents  and  to  identify  diagnostic 
biomarkers  for  these  viruses.  We  apply  various 
algorithms  to  extract  features  and  signatures  that  can  be 
used  for  point-of-care  diagnosis  of  various  viral 
infections  (common  respiratory,  arena,  flavi-,  alpha-  and 
other  viruses)  (Figure  1). 
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We  have  identified  host  gene  expression 
patterns  that  can  discriminate  exposure  to  various 
biological  threat  agents.  Each  of  these  gene  patterns 
regulated  by  a  specific  agent  reveals  the  cascade  of 
events  that  occurs  after  the  host  encounters  a  pathogenic 
agent.  Even  though  these  pathogens  initially  cause 
similar  symptoms,  such  as  malaise,  fever,  headache,  and 
cough,  the  course  of  illness  induced  by  each  of  them 
differs  in  time  frame  of  illness  patterns.  Using  these 
signature  gene  profiles  to  assess  possible  exposure  to 
pathogenic  agents  or  to  differentiate  them  from  non 
lethal  illnesses  when  the  classical  identification  of  a 
pathogen  is  not  conclusive  may  fill  a  gap  in  the  arsenal 
of  diagnostic  tools. 

We  have  studied  the  kinetics  of  host  responses  to  various 
groups  of  viruses.  The  first  group  is  common  respiratory 
viruses  (influenza  A,  parainfluenza,  rhinovirus, 
respiratory  synctical  virus). 

Second  group  constitute  of  5  arena  viruses  (the  highly 
virulent  hemorrhagic  virus,  Lassa,  and  2  additional  pairs 
of  virulent/avirulent  arena  viruses).  The  third  group 
includes  flaviviruses  (West  Nile  virus  and  4  serotypes  of 
Dengue),  and  the  forth  group  include  2  alpha  viruses, 
among  which  was  Venezuelan  equine  encephalitis 
(VEE). 

We  have  used  the  microarray  data  from  the  in  vitro 
exposures  to  various  viral  agents  as  a  “training  set”.  This 
set  was  assembled  using  data  from  PBMC  obtained  from 


~6  human  leukapheresis  donors  for  VEE  and  each  of  the 
respiratory  and  arena  viruses. 

We  applied  various  algorithms  to  these  datasets  in  order 
to  identify  signature  patterns  indicative  of  each. 

Due  to  the  lack  of  human  samples  from  exposures  to 
biothreat  pathogens,  we  are  using  in  vitro  exposures  to 
these  many  pathogenic  agents.  Our  previous  studies  have 
shown  that  high  throughput  gene  expression  data 
obtained  in  vitro  can,  to  some  extent,  represent  in  vivo 
exposures.  Our  thesis  was  that  such  an  exposure  would 
provide  a  biochemical  signature,  related  to  the 
mechanistic  course  of  action  of  each  pathogenic  agent 
and  it  should  be  reflective  of  at  least  early  events  from  in 
vivo  exposures  (to  non-human  primates-NHP). 

We  examined  our  hypothesis  that  in  vitro  exposures  can 
predict  exposures  in  vivo.  We  have  used  data  from 
PBMC  exposed  to  Venezuelan  equine  encephalitis 
(VEE)  in  vitro  and  in  vivo  using  NHP. 

Venezuelan  equine  encephalitis  is  a  mosquito-borne  viral 
disease  characterized  by  fever  and  one  or  more  of  the 
following:  severe  headache,  back  pain,  myalgias, 
prostration,  chills,  nausea,  vomiting,  weakness  and  other 
flu-like  symptoms.  The  aerosol  form  of  VEE  is  highly 
infectious,  making  VEE  a  potential  biowarfare  agent. 
This  could  be  especially  worrisome  if  strains  are  altered 
genetically  to  increase  pathogenicity.  If  this  virus  was 
deployed  efficiently,  it  could  incapacitate  thousands  of 
people  for  a  week  or  more  and  cause  untold 
psychological  stress  as  well  as  being  physically 
debilitating. 

Diagnosis  of  VEE  relies  on  virus  isolation  from  acute 
phase  serum  or  from  spinal  fluid,  or  on  detection  of 
VEEV-specific  IgM  in  the  cerebrospinal  fluid  in  cases  of 
encephalitis. 

We  carried  out  microarray  gene  expression  analysis  to 
identify  biomarkers  for  VEE.  For  the  in  vitro  studies,  we 
exposed  PBMC  for  6  healthy  donors  to  VEE  and  RNA 
samples  were  isolated  at  various  time  point  post 
exposure.  For  the  in  vivo,  we  carried  out  2  different 
approaches  using  blood  from  12-25  control  (unexposed) 
NHP  along  with  8  samples  from  VEE-exposed  NHP. 
Although  no  signs  of  illness  were  apparent  by  day  3, 
gene  expression  patterns  identified  these  NHP  as  having 
mild  exposure  to  the  virus. 

We  have  used  microarray  data  from  the  in  vivo 
exposures  to  apply  predictive  modeling  and  to  identify 
genes  that  can  be  predictors/signatures  of  exposures  to 
VEE.  We  identified  genes  that  can  be  used  as  diagnostic 
biomarkers  for  VEE. 


We  have  then  used  these  data  from  VEE  exposure  and 
data  from  exposures  to  respiratory  and  arena  viruses  to 
apply  feature  extraction  and  predictive  modeling.  These 
data  were  used  as  a  training  set  and  the  data  from  the  in 
vivo  NHP  exposures  were  used  as  a  test  set.  Applying 
feature  extraction  and  predictive  modeling  algorithms, 
we  were  able  to  correctly  predict  the  VEE  exposures  in 
>90%  of  the  NHP  samples. 

We  have  also  used  samples  from  the  NHP 
controls  and  VEE  challenged  that  were  blinded  as  “test” 
datasets.  All  of  the  blinded  samples  were  correctly 
categorized  as  to  the  nature  of  the  exposure. 

Using  the  whole  genome  expression  profiles, 
we  were  able  to  identify  biomarkers  that  would 
distinguish  VEE  from  arena  and  respiratory  viruses  and 
to  determine  exposures  in  blinded  samples. 

Rapid  detection,  before  the  symptoms  appear  or 
even  at  various  stages  of  illness,  offers  the  opportunity  to 
initiate  appropriate  treatment.  Furthermore,  this 
technique  may  provide  the  means  to  identify  new 
therapeutic  approaches  to  ameliorate  the  devastating 
results  of  these  pathogens. 


2.  EXPERIMNETAL  APPROACH 

Our  laboratory  is  studying  the  complex  interaction 
between  various  biological  pathogens  and  the  host  in 
vivo  and  in  vitro  to  understand  molecular  characteristics 
of  illnesses  induced  by  biological  threat  agents  and  to 
identify  host  defense  strategies  and  by  which  they  are 
regulated. 

We  have  used  a  library  of  40,000  human  cDNA,  of  those 
10,000  are  human  sequence  verified  cDNA,  to  construct 
customized  microarray  chips  used  in  our  studies. 
Experiments  were  carried  out  in  replicates  at  each  time 
point  for  each  pathogen  using  cDNA  microarrays. 

RNA  isolation 

RNA  isolation  was  carried  out  paccording  to  the  Trizol 
™  Reagent  manufacturer’s  recommended  instructions. 
RNA  was  ethanol-precipitated,  air-dried  and  re¬ 
suspended  in  20ul/sample  of  nuclease-free  water.  RNA 
quantity  was  measured  via  spectrophotometry  followed 
by  analysis  with  a  Bioanalyzer  2100  (Agilent 
Technologies,  CA) 

Custom  made  cDNA  Microarray  SlidePreparation  and 
Hybridization 

The  gene  library  for  the  present  project  was 
commercially  obtained  from  Operon,  containing  -37  K 
genes.  Superamine  coated  Telechem  slides  (Telechem 
Inc.,  OR)  were  used  for  printing  the  cDNA  clones  using 
12X4  pin  format,  on  a  Virtek  chip  writer  professional 
microarray er  in  KemTek,  Inc,  MD.  The  printed  slides 
underwent  UV  cross-linking,  followed  by  post-processed 
by  succinic  anhydride  treatment.  The  Micromax™ 
Tyramide  Signal  Amplification  (TSA)™  Labeling  and 


Detection  Kit  (PerkinElmer,  Inc.,  MA)  was  used  as 
directed  by  the  manufacturer  to  determine  relative  gene 
expression  of  the  collected  samples.  Sample 
hybridization  was  carried  out  at  55°C  for  sixteen  hours. 

A  laser  detection  system  was  used  (GenePix  4000b, 

Axon  Instruments,  CA)  to  scan  the  finished  slides. 
Intensity  of  the  scanned  images  was  digitalized  through 
Genepix  4.0  software  (Axon  Inc.,  CA). 

Microarray  Analysis 

Data  cleansing  and  statistical  analysis  was  carried  out 
using  Genespring®  7.0  (Agilent  Tech.,  CA).  Local 
background  was  subtracted  from  individual  spot 
intensity.  Genes  that  failed  this  ‘background  check’  in 
any  of  the  eight  given  experiments  were  eliminated  from 
further  analysis.  Each  chip  was  next  subjected  to  intra¬ 
chip  normalization  (LOWES S).  The  genes  that  varied 
most  between  control  and  treated  sample  sets  were 
selected  via  /-test  analysis.  The  p-w alue  cutoff  was  set  at 
0.05.  Four  hundreds  and  thirty  two  genes  were 
differentially  expressed  between  VEE  -infected  and 
control  uninfected  animals  with  p<0.05. 

The  pattern  of  gene  expression  variability  of  the 
experimental  set  having  reduced  dimension  was 
evaluated  using  principal  component  analysis  (PCA) 
classifying  VEE  positive  and  negative  samples  as  the 
two  variable  classes. 

We  used  the  reference  design  where  a  reference  RNA 
sample  is  co-hybridized  with  each  sample  on  the  slide. 
This  design  allows  us  to  normalize  between  the  slide  for 
variations  that  can  be  due  to  hybridization,  transcription 
and  labeling  efficiencies  (technical  variations).  We  used 
various  modules  to  analyze  the  microarray  data  including 
GeneSpring,  Partek  Pro,  SAM  and  Bioconductor.  Using 
Analysis  of  Variance  (ANOVA)  we  determined  genes 
that  exhibited  variations  in  expression  between  the 
control  samples.  These  variations  may  be  due  to  many 
factors  including  biological  and  technical  variations. 
These  normally  varying  genes  are  excluded  from  further 
analysis  to  study  gene  regulation  upon  exposure  to 
pathogens. 

Selection  of  genes  (biomarkers)  and  predictive  analysis: 
Gene  Selection 

To  select  genes  for  use  in  the  predictor,  all  genes  are 
examined  individually  and  ranked  on  their  power  to 
discriminate  each  class  from  all  others,  using  the 
information  on  that  gene  alone.  For  each  gene,  and  each 
class,  all  possible  cutoff  points  on  gene  expression  level 
for  that  gene  are  considered  to  predict  class  membership 
either  above  or  below  that  cutoff.  Genes  are  scored  on 
the  basis  of  the  best  prediction  point  for  that  class.  The 
score  function  is  the  negative  natural  logarithm  of  the  p- 
value  for  a  hypergeometric  test  of  predicted  versus  actual 
class  membership  for  this  class  versus  all  others. 

A  combined  list  containing  the  most  discriminating 
genes  for  each  class  is  produced  as  the  predictor  list. 

Each  class  is  examined  in  turn,  and  the  gene  with  the 
highest  score  for  that  class  is  added  to  the  list,  if  it  is  not 


already  on  the  list.  Then  genes  with  the  next  highest 
scores  for  each  class  are  added.  This  is  continued  in 
rotation  among  the  classes  until  the  specified  number  of 
predictor  genes  is  obtained. 

We  applied  the  Fisher's  Exact  Test  that  looks  for  an 
association  between  expression  level  and  class 
membership.  Each  gene  is  tested  for  its  ability  to 
discriminate  between  the  classes.  Genes  with  the  lowest 
p- values  are  kept  for  the  subsequent  calculations. 

In  this  method,  all  the  measurements  for  a  given  gene  are 
ordered  according  to  their  normalized  expression  levels. 
For  each  class,  the  predictor  places  a  mark  in  the  list 
where  the  relative  abundance  of  the  class  on  one  side  of 
the  mark  is  the  highest  in  comparison  to  the  other  side  of 
the  mark.  The  genes  that  are  most  accurately  segregated 
by  these  markers  are  considered  to  be  the  most 
predictive.  A  list  of  the  most  predictive  genes  is  made  for 
each  class  and  an  equal  number  of  genes  (lowest  p-value 
using  Fischer's  exact  test)  are  taken  from  each  list. 
Classifying  the  Test  Samples 
Based  on  the  selected  genes,  classifications  are  then 
predicted  for  the  independent  test  data,  using  the  k- 
nearest-neighbors  rule.  A  sample  in  the  independent  set 
is  classified  by  finding  the  k  nearest  neighbors  of  the 
sample  among  the  training  set  samples,  based  on 
Euclidean  distance  between  the  normalized  expression 
ratio  profiles  of  the  samples.  The  class  memberships  of 
the  neighbors  are  examined,  and  the  new  sample  is 
assigned  to  the  class  showing  the  largest  relative 
proportion  among  the  neighbors  after  adjusting  for  the 
proportion  of  each  class  in  the  training  set. 

3.  RESULTS 

Study  of  host  responses  to  VEE  in  vitro  and  in  vivo  in 
NHP: 

We  have  used  microarrays  to  study  host  response  to  VEE 
in  vitro  using  human  PBMC  at  various  time  points  and  in 
vivo  using  NHPs  (Figure  2). 


Microarray  analysis  of  host  responses  to  VEE  in  vitro 
in  PBMC: 

We  carried  out  microarray  gene  expression  analysis  of 
PBMC  samples  exposed  to  VEE  at  various  time  points. 
We  identified  genes  that  exhibited  highly  statistically 
significant  differential  expression  in  PBMC  exposed  to 
VEE  at  various  time  points  when  compared  to  control 
untreated  cells  (Figure  3). 
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Figure  3:  Cluster  analysis  of  genes  that  were 
differentially  expressed  in  PBMC  exposed  to  VEE  at 
various  time  points  when  compared  to  control  untreated 
cells. 

Microarray  analysis  of  host  responses  to  VEE  in  vivo 
in  NHPs: 

We  carried  out  microarray  analyses  using  blood 
samples  obtained  from  12  control  (unexposed) 
NHPs  along  with  8  samples  from  VEE-exposed 
NHPs.  Although  no  signs  of  illness  were 
apparent  by  day  3,  gene  expression  patterns 
identified  these  NHP  as  having  mild  exposure 
to  the  virus. 

Genes  that  were  significantly  altered  by  VEE  were 
clustered  to  show  their  expression  profiles.  These  genes 
were  consistently  regulated  by  VEE  (figure  4). 


Figure  2:  scheme  of  the  experimental  design  for  t  he 
VEE  study. 
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Figure  4.  Cluster  view  and  expression  profiles  of  genes 
that  were  differentially  expressed  between  NHPs 
exposed  to  VEE  and  control  NHPs. 


We  carried  out  data  mining  and  functional  classification 
using  our  in-house  developed  tools  for  pathway  and 
literature  mining  showed  that  some  of  the  genes  that 
were  highly  up-regulated  by  VEE  were  involved  in 
apoptotic  pathways  and  cell  cycle  (Figure  5). 


Figure  5.  functional  analysis  of  genes  that  were  highly 
up  regulated  by  VEE  in  the  treated  NHPs. 

Use  of  predictive  modeling  and  feature  extraction  to 
identify  exposures  in  blinded  samples: 

To  estimate  the  level  of  confidence  in  the 
methodology,  we  used  our  microarray  data 
from  the  NHP  study  as  a  training  set.  The  NHP 
samples  from  12  control  and  5  VEE  challenged 
NHP  were  blinded  as  “test”  datasets. 

We  applied  principal  component  analysis  to  examine  the 
behavior  of  these  samples  and  plotted  the  PCA 
components  (Figure  6). 


Figure  6.  Principal  component  analysis  of  known  and 
blinded  sets.  The  blinded  set  is  shown  in  yellow,  the 
VEE  treated  samples  are  shown  in  blue  and  the  control 
untreated  samples  were  shown  in  red. 

We  used  the  k-nearest  neighbors  algorithm  with  a 
p<0.02.  We  identified  50  genes  that  can  be  predictors  of 
exposure  to  VEE. 

All  of  the  17  samples  were  correctly  categorized  as  to  the 
nature  of  the  exposure.  Using  both  types  of  training/test 
methods,  one  control  NHP  was  categorized  correctly  as  a 
“control”  but  had  at  least  some  indicators  of  common 


viral  exposure  (Table  1). 


Sample 

Predicted  Value 

p  value 

13256433_sampleA.gpr 
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0.00151 

13256434_sampleB.gpr 
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0.0113 

1 3250846_sampleC.gpr 
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0.00151 

1 3250847_sampleD.gpr 
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0.0113 

1 3250848_sampleE.gpr 

C 

0.208 

13250849_sampleF.gpr 
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0.00151 

13256857_sampleG.gpr 
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0.00151 

13256858_sampleH.gpr 
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0.00151 

1 3256859_samplel.gpr 
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0.0113 

1 3256860_sampleJ.gpr 
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0.0317 
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C 

0.00151 

1 3256854_sampleR.gpr 
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Identification  of  biomarkers  that  can  distinguish  VEE 
from  common  viruses: 

Venezuelan  equine  encephalitis  is  mosquito-bome  viral 
disease  characterized  by  early  flu-like  symptoms. 

To  identify  biomarkers  that  are  unique  for  VEE  and  can 
distinguish  between  exposure  to  VEE  and  other  common 
viral  agents,  we  studied  host  responses  to  representatives 
of  respiratory  and  arena  viruses  to  compare  the  host 
genome  regulation  to  that  of  VEE. 

We  carried  out  temporal  analysis  of  gene  expression  to 
the  2  groups  of  viruses  alongwith  VEE.  Group  #1 
contained  common  respiratory  viruses  (influenza  A, 
parainfluenza,  rhino  virus,  respiratory  synctical  virus). 
Group  #2  focused  on  5  arena  viruses  (the  highly  virulent 
hemorrhagic  virus,  Lassa,  and  2  additional  pairs  of 
virulent/avirulent  arena  viruses)  (figure  7). 
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Figure  7.  Experimental  design  of  the  study  carried  out  to 
identify  biomarkers  that  would  distinguish  VEE  from 
other  viral  agents. 


We  applied  a  PC  A  analysis  on  the  various  samples  and 
showed  that  the  host  response  to  VEE  have  some  unique 
profile  when  compared  to  the  arena  and  respiratory 
viruses  (Figure  8). 


Figure  8.  3-D  scatter 
plot  of  the  principal 
component  analysis 
applied  on  samples 
obtained  from 
PBMC  exposed  to 
VEE,  arena  and 
respiratory  viruses. 
Yellow:  VEE 
Red:  control  samples 
Blue:  Arena  and 
respiratory  viruses 


Y:  PCA  component  2  (1 1 .61  %  variance) 
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We  applied  ANOVA  followed  by  Benjamini  correction 
to  identify  genes  that  are  differentially  expressed 
between  VEE  and  the  other  2  groups.  We  identified  53 
genes  that  can  distinguish  exposure  to  VEE  from  arena 
and  respiratory  viruses  (figure  9). 


Figure  9.  Cluster  view  of  genes  that  were  significantly 
differentially  expressed  between  VEE  (yellow)  and  arena 
and  respiratory  viruses  (blue).  These  genes  were 
obtained  using  ANOVA  with  p<0.05  followed  by 
Benjamini  correction. 


1 3  (7.31 2%  variance) 


We  determined  the  expression  patterns  of  these  genes 
and  showed  that  they  can  be  used  as  biomarkers  for  VEE 
(figure  10). 
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study  as  a  test  set.  We  were  able  to  correctly  confirm  an 
exposure  to  VEE  in  15  out  of  17  samples  (table  2). 
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Figure  10.  Expression  patterns  of  genes  differentially 
expressed  between  VEE  and  Arena  and  respiratory 
viruses.  A.  Genes  colored  in  green  are  up  regulated  by 
VEE  and  down  regulated  by  the  arena  and  respiratory 
viruses  while  the  red  colored  genes  are  down  regulated 
by  VEE  and  up  regulated  by  the  arena  and  respiratory 
viruses.  B.  some  the  gene  are  plotted  to  show  patterns  of 
expression  in  arena  viruses  (blue),  respiratory  viruses 
(red)  and  VEE  (green). 

Use  of  in  vitro  data  to  predict  the  exposure  in  vivo: 


4.  CONCLUSION 

cDNA  microarray  studies  were  effectively  used  for 
“training”  datasets  to  address  the  following  questions:  1) 
Are  there  common  genes  modulated  under  both  NHP  in 
vivo  and  human  in  vitro  conditions  in  response  to  VEE 
exposure?  2)  Does  this  common  set  of  genes  permit 
sufficient  discrimination  of  a  test  data  set  based  on  an  in 
vivo  pathogen  exposure  against  a  database  derived  from 
in  vitro  exposures  to  several  pathogens  especially 
respiratory  and  arena  viruses?  Our  successes  in  correct 
identification  of  exposures  in  NHP  to  VEE  provide  a 
framework  for  further  work  in  validating  and 
differentiating  viral  exposures  from  other  pathogenic 
agents  using  blood  from  in  NHP  or  human  exposures. 
Furthermore,  new  technology  for  multiplexing  genomic 
responses  has  decreased  the  time  required  for  such 
analyses  to  nearly  “point  of  care”  applications.  This 
study  is  part  of  our  larger  quest  to  create  a  library  of  host 
gene  expression  responses  that  broadly  distinguish 
among  common  and  unconventional  pathogenic  agents, 
showing  severity  of  responses  via  a  molecular 
pathogenesis  blueprint  that  will  enable  us  to  design  novel 
methods  for  intervention.  Finding  a  small  number  of 
genes  that  can  accurately  classify  samples  is  useful 
because  these  predictor  genes  could  prove  essential  in 
designing  future  diagnostic  assays  for  early  detection 
upon  exposure  to  such  toxic  agents. 


To  determine  whether  these  data  can  predict  an  exposure 
to  VEE,  we  used  the  data  from  PBMC  treated  with  each 
of  the  viruses  (VEE,  arena  and  respiratory  viruses)  as  a 
training  set  and  the  data  obtained  from  NHP  in  vivo  VEE 
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VEE 

Dengue  1,  2,  3,  4 

WNV 

Influenza  A 

Parainfluenza  I 

Rhino 

RSV 

LCMY-WE 

LCMY-ARM 

Mopeia 

Mopeia-Lassa 


Systems  Used: 

In  vitro: 

Human  peripheral 
blood  mononuclear 
cells  exposed  to 
Biological  threat 
agents. 

In  vivo: 

Monkeys  exposed  to 
VEE,  LCMV-WE  and 
ARM 

Human  Volunteers  for 
Dengue  exposure 


•  Rapid  detection  of  exposure  to  biothreat  agents  is  essential 
so  that  initiation  of  treatment  can  begin  promptly. 

O  Gene  profiling  provides  potential  for  very  early  detection  strategies 

(for  classic  or  emerging/modified  threat  agents). 
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Genomic  Analyses  of  Host  Responses  to 
Venezuelan  Equine  Encephalitis 


•Alphaviruse  could  theoretically  be  produced  in  either 
wet  or  dried  form  and  stabilized  for  weaponization. 

•Incubation  Period:  1-6  days 
•Signs/Symptoms:  VEE  may  be  characterized  by 
sudden  onset  with  malaise,  high  fever  (101-105°F), 
severe  headache,  rigors,  photophobia,  myalgias 
(especially  in  the  legs  and  lumbosacral  area),  cough, 
sore  throat,  and  vomiting.  These  symptoms  may  be 
followed  by  a  prolonged  period  of  asthenia  and  lethargy. 
With  natural  infection  children,  and  rarely  adults,  may 
develop  encephalitis. 


Diagnosis 

•Differential  Diagnosis :  Cases  with  VEE  may  be  difficult 
to  distinguish  from  those  with  influenza.  Appearance  of 
cases  with  neurological  symptoms  may  suggest  the 
diagnosis.  Other  otential  bioterrorist  agents  that  cause 
flu-like  illnesses  ( Bacillus  anthracis,  Yersinia  pestis, 
Coxiella  burnetii)  need  to  be  considered. 

•Diagnostic  Tests :  Virus  may  be  isolated  from  blood  or 
throat  swabs  during  the  acute  phase  of  illness.  Blood 
may  be  collected  for  culture  in  a  tiger-top  (SST)  or  red- 
top  tube.  Throat  swabs  may  be  collected  for  culture  up 
to  5  days  after  onset  of  symptoms.  Cerebrospinal  fluid 
(CSF)  may  also  be  collected  for  culture  from  patients 
^/vith  neurologic  symptoms. 


Identification  of  biomarkers  for  VEE  to  identify 
exposures. 

Identification  of  signatures  for  VEE  compared  to 
common  viral  exposures. 


Identification  of  biomarkers  for  VEE  to  identify 
exposures. 
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Analysis  of  RNA  samples  Isolated  using  TriZol 
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Cluster  view  and  expression  profiles  of  genes  that  were  differentially 
expressed  between  NHPs  exposed  to  VEE  and  control  NHPs 


Functional  analysis  of  genes  that  were  highly  up  regulated  by  VEE  in  the  treated  NHPs. 
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To  evaluate  our  oligo  microarray  system  we 
were  asked  to  use  our  data  to  predict 
exposures  to  VEE  using  a  blinded  test  set.  We 
were  given  1 8  blinded  RNA  samples  from 
both  control  and  VEE  treated  NHP.  We  used 
known  control  and  VEE  samples  to  train  the 
algorithms.  We  used  our  whole  genome  oligo 
microarrays  (36K)  to  hybridize  the  training 
and  test  sets.  We  analyzed  the  microarray 
data  for  both  sets.  We  then  applied  predictive 
modeling  and  feature  extraction  algorithms  to 
predict  and  identify  exposures  to  VEE  in 
blinded  samples. 


;  y:  i-'ua  componeni  z  fi  u.y^1*  variance; 

■  i  i 


inded  set:  PCA  Using  all  genes  (~28K) 


sample  K 


sample  M 


sample  B 
sample  L 


sample  D  ,  . 
sample  I 


tr  _  Q 

component  3  (9.006%  Variance) 


axis:  PCA  component  1  (25.4%  variance) 
axis:  PCA  component  2  (1 0.92%  variance) 


sample  N 


sample?t»c"H 


sample  ^mp|0  r 


sample  C 


.X:  PCA  component  1  (25.4%  variance) 


Z-axis:  PCA  component  3  (9.0S6%  variance) 

Conditions:  Blinded  set,  Default  Interpretation 


m  Out  Zoom  Fully  Out  Display  Options 


View:  1 3D  Scatter  Plo 


Close  |  Help 


X-a 

xis: 

^CA  component  1  (34.42^ 

variance) 

Conditions: 

training  and  test,  Default  Interpretation 

Y-a 

Z-a 

xis: 

xis:  F 

DCA  component  2  (9.598^ 
DCA  component  3  (6.638^ 

variance) 
j  variance) 

Colored  by: 

Parameter  sample 

loom  Out 


Zoom  Fully  Out 


Display  Options... 


Close  |  Help  | 


start  S  0  fi 


1  £  j:  L=i 


m 


View:  1 3D  Scatter  I 


sample 


m  Out  Zoom  Fully  Out  Display  Options 


Close  |  Help 


View:  1 3D  Scatter  Plo 


Treatment 


Prediction  results 


Prediction  results 


Condition 

Treatment  T  ,  ID  495  ,  Day  14 
Treatment  T  ,  ID  573  ,  Day  14 
Treatment  T  ,  ID  479  ,  Day  3 
Treatment  T  ,  ID  476  ,  Day  3 
Treatment  T  ,  ID  494  ,  Day  4 
Treatment  T  ,  ID  498  ,  Day  4 
Treatment  C  ,  ID  492  ,  Day  C 
Treatment  C  ,  ID  494  ,  Day  C 
Treatment  C  ,  ID  479  ,  Day  C 
Treatment  C  ,  ID  573  ,  Day  C 
Treatment  C  ,  ID  157  ,  Day  C 
Treatment  C  ,  ID  192  ,  Day  C 
Treatment  C  ,  ID  476  ,  Day  C 
Treatment  C  ,  ID  476  ,  Day  C 
Treatment  C  ,  ID  479  ,  Day  C 
Treatment  C  ,  ID  494  ,  Day  C 
Treatment  C  ,  ID  498  ,  Day  C 
Treatment  C  ,  ID  498  ,  Day  C 
^Treatment  C  ,  ID  573  ,  Day  C 


Prediction 

P  value 

Arena  vote 

C  vote 

VEE  vote 

Respiratory  vote 

VEE 

0.038 

3 

0 

4 

3 

VEE 

0.00241 

0 

4 

6 

0 

VEE 

0.00515 

0 

2 

5 

3 

VEE 

0.0051 

1 

1 

5 

3 

VEE 

0.000448 

0 

2 

6 

2 

VEE 

0.111 

0 

5 

5 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

5.31E-05 

0 

8 

1 

1 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

C 

2.70E-08 

0 

10 

0 

0 

Genomic  Analyses  of  Host  Responses  to 
Venezuelan  Equine  Encephalitis 


Identification  of  signatures  for  VEE  compared  to  common 
viral  exposures 


Diagnosis: 

Differential  Diagnosis:  Cases  with  VEE  may  be  difficult  to  distinguish 
from  those  with  influenza.  Appearance  of  cases  with  neurological 
symptoms  may  suggest  the  diagnosis. 
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Benjamini 
correction. 
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Predictive  modeling  to 
identify  genes  signatures  of 
exposure  to  VEE  in  vitro. 
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predicted  correctly. 
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points  and  the  VEE  in  vitro 
samples  as  a  test  set. 

The  Fisher's  Exact  Test 
method  was  used  to  select 
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Expression  patterns  of  genes  selected  to  be  unique  signatures  for  VEE 


